Overview

Dataset statistics

Number of variables24
Number of observations9426
Missing cells72
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.0 MiB
Average record size in memory1004.3 B

Variable types

CAT11
NUM11
DATE2

Warnings

Customer Name has a high cardinality: 2703 distinct values High cardinality
Product Name has a high cardinality: 1263 distinct values High cardinality
City has a high cardinality: 1424 distinct values High cardinality
Order ID is highly correlated with Row IDHigh correlation
Row ID is highly correlated with Order IDHigh correlation
Product Sub-Category is highly correlated with Product CategoryHigh correlation
Product Category is highly correlated with Product Sub-CategoryHigh correlation
State or Province is highly correlated with RegionHigh correlation
Region is highly correlated with State or ProvinceHigh correlation
Row ID has unique values Unique
Discount has 848 (9.0%) zeros Zeros

Reproduction

Analysis started2020-09-20 10:59:11.922808
Analysis finished2020-09-20 11:00:32.337709
Duration1 minute and 20.41 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Row ID
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct9426
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20241.01528
Minimum2
Maximum26399
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum2
5-th percentile4040.5
Q119330.25
median21686.5
Q324042.75
95-th percentile25927.75
Maximum26399
Range26397
Interquartile range (IQR)4712.5

Descriptive statistics

Standard deviation6101.890965
Coefficient of variation (CV)0.3014617044
Kurtosis2.999815155
Mean20241.01528
Median Absolute Deviation (MAD)2356.5
Skewness-1.936310388
Sum190791810
Variance37233073.35
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
184311< 0.1%
 
197551< 0.1%
 
60411< 0.1%
 
238811< 0.1%
 
218321< 0.1%
 
259261< 0.1%
 
197791< 0.1%
 
238731< 0.1%
 
218241< 0.1%
 
259181< 0.1%
 
197711< 0.1%
 
74811< 0.1%
 
218161< 0.1%
 
259101< 0.1%
 
197631< 0.1%
 
238571< 0.1%
 
218081< 0.1%
 
197871< 0.1%
 
259341< 0.1%
 
218401< 0.1%
 
239051< 0.1%
 
259661< 0.1%
 
34351< 0.1%
 
239131< 0.1%
 
46661< 0.1%
 
Other values (9401)940199.7%
 
ValueCountFrequency (%) 
21< 0.1%
 
271< 0.1%
 
521< 0.1%
 
531< 0.1%
 
621< 0.1%
 
631< 0.1%
 
641< 0.1%
 
661< 0.1%
 
671< 0.1%
 
681< 0.1%
 
ValueCountFrequency (%) 
263991< 0.1%
 
263981< 0.1%
 
263971< 0.1%
 
263961< 0.1%
 
263951< 0.1%
 
263941< 0.1%
 
263931< 0.1%
 
263921< 0.1%
 
263911< 0.1%
 
263901< 0.1%
 

Order Priority
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
High
1970 
Low
1926 
Not Specified
1881 
Medium
1844 
Critical
1804 
ValueCountFrequency (%) 
High197020.9%
 
Low192620.4%
 
Not Specified188120.0%
 
Medium184419.6%
 
Critical180419.1%
 
Critical 1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length13
Median length6
Mean length6.748992149
Min length3

Overview of Unicode Properties

Unique unicode characters23
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
i1118617.6%
 
e56068.8%
 
o38076.0%
 
d37255.9%
 
t36865.8%
 
c36865.8%
 
H19703.1%
 
g19703.1%
 
h19703.1%
 
L19263.0%
 
w19263.0%
 
18823.0%
 
N18813.0%
 
S18813.0%
 
p18813.0%
 
f18813.0%
 
M18442.9%
 
u18442.9%
 
m18442.9%
 
C18052.8%
 
r18052.8%
 
a18052.8%
 
l18052.8%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5042779.3%
 
Uppercase Letter1130717.8%
 
Space Separator18823.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
H197017.4%
 
L192617.0%
 
N188116.6%
 
S188116.6%
 
M184416.3%
 
C180516.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
i1118622.2%
 
e560611.1%
 
o38077.5%
 
d37257.4%
 
t36867.3%
 
c36867.3%
 
g19703.9%
 
h19703.9%
 
w19263.8%
 
p18813.7%
 
f18813.7%
 
u18443.7%
 
m18443.7%
 
r18053.6%
 
a18053.6%
 
l18053.6%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
1882100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin6173497.0%
 
Common18823.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
i1118618.1%
 
e56069.1%
 
o38076.2%
 
d37256.0%
 
t36866.0%
 
c36866.0%
 
H19703.2%
 
g19703.2%
 
h19703.2%
 
L19263.1%
 
w19263.1%
 
N18813.0%
 
S18813.0%
 
p18813.0%
 
f18813.0%
 
M18443.0%
 
u18443.0%
 
m18443.0%
 
C18052.9%
 
r18052.9%
 
a18052.9%
 
l18052.9%
 

Most frequent Common characters

ValueCountFrequency (%) 
1882100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII63616100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
i1118617.6%
 
e56068.8%
 
o38076.0%
 
d37255.9%
 
t36865.8%
 
c36865.8%
 
H19703.1%
 
g19703.1%
 
h19703.1%
 
L19263.0%
 
w19263.0%
 
18823.0%
 
N18813.0%
 
S18813.0%
 
p18813.0%
 
f18813.0%
 
M18442.9%
 
u18442.9%
 
m18442.9%
 
C18052.8%
 
r18052.8%
 
a18052.8%
 
l18052.8%
 

Discount
Real number (ℝ≥0)

ZEROS

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04962762572
Minimum0
Maximum0.25
Zeros848
Zeros (%)9.0%
Memory size73.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.02
median0.05
Q30.08
95-th percentile0.1
Maximum0.25
Range0.25
Interquartile range (IQR)0.06

Descriptive statistics

Standard deviation0.03179842508
Coefficient of variation (CV)0.6407404065
Kurtosis-0.9872078038
Mean0.04962762572
Median Absolute Deviation (MAD)0.03
Skewness0.07204455118
Sum467.79
Variance0.001011139837
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
0.018989.5%
 
0.038829.4%
 
0.058799.3%
 
0.098719.2%
 
0.028709.2%
 
0.048619.1%
 
0.088509.0%
 
08489.0%
 
0.18388.9%
 
0.068218.7%
 
0.078038.5%
 
0.171< 0.1%
 
0.161< 0.1%
 
0.211< 0.1%
 
0.111< 0.1%
 
0.251< 0.1%
 
ValueCountFrequency (%) 
08489.0%
 
0.018989.5%
 
0.028709.2%
 
0.038829.4%
 
0.048619.1%
 
0.058799.3%
 
0.068218.7%
 
0.078038.5%
 
0.088509.0%
 
0.098719.2%
 
ValueCountFrequency (%) 
0.251< 0.1%
 
0.211< 0.1%
 
0.171< 0.1%
 
0.161< 0.1%
 
0.111< 0.1%
 
0.18388.9%
 
0.098719.2%
 
0.088509.0%
 
0.078038.5%
 
0.068218.7%
 

Unit Price
Real number (ℝ≥0)

Distinct751
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean88.30368555
Minimum0.99
Maximum6783.02
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum0.99
5-th percentile2.88
Q16.48
median20.99
Q385.99
95-th percentile320.64
Maximum6783.02
Range6782.03
Interquartile range (IQR)79.51

Descriptive statistics

Standard deviation281.5409817
Coefficient of variation (CV)3.188326511
Kurtosis276.5284612
Mean88.30368555
Median Absolute Deviation (MAD)17.01
Skewness14.13514588
Sum832350.54
Variance79265.32438
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
6.482983.2%
 
65.992192.3%
 
4.981531.6%
 
125.991311.4%
 
5.981181.3%
 
2.88921.0%
 
30.98860.9%
 
35.99780.8%
 
20.99780.8%
 
205.99700.7%
 
19.98690.7%
 
115.99660.7%
 
6.68640.7%
 
4.91610.6%
 
4.13590.6%
 
4.28580.6%
 
85.99580.6%
 
55.99580.6%
 
150.98570.6%
 
100.98560.6%
 
5.28540.6%
 
3.28490.5%
 
40.98480.5%
 
195.99470.5%
 
155.99460.5%
 
Other values (726)725376.9%
 
ValueCountFrequency (%) 
0.992< 0.1%
 
1.14130.1%
 
1.26140.1%
 
1.48130.1%
 
1.660.1%
 
1.68240.3%
 
1.790.1%
 
1.74100.1%
 
1.76340.4%
 
1.83< 0.1%
 
ValueCountFrequency (%) 
6783.0270.1%
 
3502.1460.1%
 
3499.9980.1%
 
2550.1470.1%
 
2036.4880.1%
 
1938.0280.1%
 
1889.993< 0.1%
 
1637.532< 0.1%
 
1500.9760.1%
 
1360.144< 0.1%
 

Shipping Cost
Real number (ℝ≥0)

Distinct652
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.79514216
Minimum0.49
Maximum164.73
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum0.49
5-th percentile0.8
Q13.1925
median6.05
Q313.99
95-th percentile55.285
Maximum164.73
Range164.24
Interquartile range (IQR)10.7975

Descriptive statistics

Standard deviation17.18120278
Coefficient of variation (CV)1.342791081
Kurtosis7.646014433
Mean12.79514216
Median Absolute Deviation (MAD)3.63
Skewness2.544518927
Sum120607.01
Variance295.1937288
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
19.993984.2%
 
8.993663.9%
 
1.992782.9%
 
0.52172.3%
 
0.991671.8%
 
41581.7%
 
1.491551.6%
 
0.71551.6%
 
24.491491.6%
 
2.991421.5%
 
351241.3%
 
2.51221.3%
 
301041.1%
 
13.991001.1%
 
6.5880.9%
 
1.39840.9%
 
5800.8%
 
3.99720.8%
 
4.2600.6%
 
49580.6%
 
4.5560.6%
 
1.25550.6%
 
5.26520.6%
 
5.99500.5%
 
60450.5%
 
Other values (627)609164.6%
 
ValueCountFrequency (%) 
0.49370.4%
 
0.52172.3%
 
0.71551.6%
 
0.71240.3%
 
0.731< 0.1%
 
0.7580.1%
 
0.7680.1%
 
0.7880.1%
 
0.793< 0.1%
 
0.8250.3%
 
ValueCountFrequency (%) 
164.731< 0.1%
 
154.121< 0.1%
 
147.122< 0.1%
 
143.711< 0.1%
 
1301< 0.1%
 
1261< 0.1%
 
110.2120.1%
 
9970.1%
 
91.0580.1%
 
89.3140.1%
 

Customer ID
Real number (ℝ≥0)

Distinct2703
Distinct (%)28.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1738.422236
Minimum2
Maximum3403
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum2
5-th percentile181
Q1898
median1750
Q32578.75
95-th percentile3238.75
Maximum3403
Range3401
Interquartile range (IQR)1680.75

Descriptive statistics

Standard deviation979.1671968
Coefficient of variation (CV)0.5632505017
Kurtosis-1.183463844
Mean1738.422236
Median Absolute Deviation (MAD)837.5
Skewness-0.04771704704
Sum16386368
Variance958768.3993
MonotocityIncreasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1193270.3%
 
699260.3%
 
2107220.2%
 
2491220.2%
 
2882210.2%
 
308210.2%
 
3079200.2%
 
272190.2%
 
1999190.2%
 
1723180.2%
 
1129180.2%
 
1959170.2%
 
1821170.2%
 
1413170.2%
 
1314170.2%
 
1106170.2%
 
3075160.2%
 
553160.2%
 
2548160.2%
 
640160.2%
 
1402160.2%
 
3151160.2%
 
1745160.2%
 
102150.2%
 
1796150.2%
 
Other values (2678)896695.1%
 
ValueCountFrequency (%) 
21< 0.1%
 
360.1%
 
52< 0.1%
 
64< 0.1%
 
71< 0.1%
 
81< 0.1%
 
91< 0.1%
 
101< 0.1%
 
111< 0.1%
 
121< 0.1%
 
ValueCountFrequency (%) 
34032< 0.1%
 
34024< 0.1%
 
340070.1%
 
33994< 0.1%
 
33983< 0.1%
 
339760.1%
 
339670.1%
 
33944< 0.1%
 
339350.1%
 
33911< 0.1%
 

Customer Name
Categorical

HIGH CARDINALITY

Distinct2703
Distinct (%)28.7%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Louis Parrish
 
27
Jenny Gold
 
26
Leigh Burnette Hurley
 
22
Sean N Boyer
 
22
Glen Caldwell
 
21
Other values (2698)
9308 
ValueCountFrequency (%) 
Louis Parrish270.3%
 
Jenny Gold260.3%
 
Leigh Burnette Hurley220.2%
 
Sean N Boyer220.2%
 
Glen Caldwell210.2%
 
Andrew Gonzalez210.2%
 
Andrew Levine200.2%
 
Priscilla Kane190.2%
 
Eleanor Swain190.2%
 
Pam Patton180.2%
 
Constance Flowers180.2%
 
Vanessa Boyer170.2%
 
Bonnie Matthews Rowland170.2%
 
Maxine Collier Grady170.2%
 
Keith Marsh170.2%
 
Pamela Wiley170.2%
 
Herbert Holden160.2%
 
Glenda Hunter160.2%
 
Neal Wolfe160.2%
 
Kristine Connolly160.2%
 
Wesley Tate160.2%
 
Wayne Bass160.2%
 
Gordon Brandt160.2%
 
Carlos Johnson150.2%
 
Caroline Johnston150.2%
 
Other values (2678)896695.1%
 
Frequencies of value counts

Unique

Unique897 ?
Unique (%)9.5%
Histogram of lengths of the category

Length

Max length28
Median length13
Mean length13.18544452
Min length6

Overview of Unicode Properties

Unique unicode characters54
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e1245910.0%
 
104298.4%
 
a102408.2%
 
n102358.2%
 
r89537.2%
 
l67765.5%
 
o67525.4%
 
i66815.4%
 
s45623.7%
 
t43233.5%
 
y36312.9%
 
h29912.4%
 
d28202.3%
 
c25342.0%
 
u20891.7%
 
B18591.5%
 
m17571.4%
 
M16281.3%
 
C15411.2%
 
S14991.2%
 
H14051.1%
 
g12951.0%
 
J12451.0%
 
R11831.0%
 
k10910.9%
 
Other values (29)1430811.5%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter9350575.2%
 
Uppercase Letter2026316.3%
 
Space Separator104298.4%
 
Other Punctuation890.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
B18599.2%
 
M16288.0%
 
C15417.6%
 
S14997.4%
 
H14056.9%
 
J12456.1%
 
R11835.8%
 
G10815.3%
 
L10615.2%
 
K9724.8%
 
D9674.8%
 
W9354.6%
 
P8914.4%
 
A8654.3%
 
E7173.5%
 
T6363.1%
 
F6023.0%
 
N4552.2%
 
V2321.1%
 
O2101.0%
 
Y1170.6%
 
I850.4%
 
Z490.2%
 
X130.1%
 
U120.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e1245913.3%
 
a1024011.0%
 
n1023510.9%
 
r89539.6%
 
l67767.2%
 
o67527.2%
 
i66817.1%
 
s45624.9%
 
t43234.6%
 
y36313.9%
 
h29913.2%
 
d28203.0%
 
c25342.7%
 
u20892.2%
 
m17571.9%
 
g12951.4%
 
k10911.2%
 
w10801.2%
 
b9711.0%
 
v6630.7%
 
p5230.6%
 
f4920.5%
 
z3040.3%
 
x2010.2%
 
j690.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
10429100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
'89100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin11376891.5%
 
Common105188.5%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e1245911.0%
 
a102409.0%
 
n102359.0%
 
r89537.9%
 
l67766.0%
 
o67525.9%
 
i66815.9%
 
s45624.0%
 
t43233.8%
 
y36313.2%
 
h29912.6%
 
d28202.5%
 
c25342.2%
 
u20891.8%
 
B18591.6%
 
m17571.5%
 
M16281.4%
 
C15411.4%
 
S14991.3%
 
H14051.2%
 
g12951.1%
 
J12451.1%
 
R11831.0%
 
k10911.0%
 
G10811.0%
 
Other values (27)1313811.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
1042999.2%
 
'890.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII124286100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e1245910.0%
 
104298.4%
 
a102408.2%
 
n102358.2%
 
r89537.2%
 
l67765.5%
 
o67525.4%
 
i66815.4%
 
s45623.7%
 
t43233.5%
 
y36312.9%
 
h29912.4%
 
d28202.3%
 
c25342.0%
 
u20891.7%
 
B18591.5%
 
m17571.4%
 
M16281.3%
 
C15411.2%
 
S14991.2%
 
H14051.1%
 
g12951.0%
 
J12451.0%
 
R11831.0%
 
k10910.9%
 
Other values (29)1430811.5%
 

Ship Mode
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Regular Air
7036 
Delivery Truck
1283 
Express Air
1107 
ValueCountFrequency (%) 
Regular Air703674.6%
 
Delivery Truck128313.6%
 
Express Air110711.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length14
Median length11
Mean length11.40833864
Min length11

Overview of Unicode Properties

Unique unicode characters20
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
r1885217.5%
 
e1070910.0%
 
94268.8%
 
i94268.8%
 
u83197.7%
 
l83197.7%
 
A81437.6%
 
R70366.5%
 
g70366.5%
 
a70366.5%
 
s22142.1%
 
D12831.2%
 
v12831.2%
 
y12831.2%
 
T12831.2%
 
c12831.2%
 
k12831.2%
 
E11071.0%
 
x11071.0%
 
p11071.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter7925773.7%
 
Uppercase Letter1885217.5%
 
Space Separator94268.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
A814343.2%
 
R703637.3%
 
D12836.8%
 
T12836.8%
 
E11075.9%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
r1885223.8%
 
e1070913.5%
 
i942611.9%
 
u831910.5%
 
l831910.5%
 
g70368.9%
 
a70368.9%
 
s22142.8%
 
v12831.6%
 
y12831.6%
 
c12831.6%
 
k12831.6%
 
x11071.4%
 
p11071.4%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
9426100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin9810991.2%
 
Common94268.8%
 

Most frequent Latin characters

ValueCountFrequency (%) 
r1885219.2%
 
e1070910.9%
 
i94269.6%
 
u83198.5%
 
l83198.5%
 
A81438.3%
 
R70367.2%
 
g70367.2%
 
a70367.2%
 
s22142.3%
 
D12831.3%
 
v12831.3%
 
y12831.3%
 
T12831.3%
 
c12831.3%
 
k12831.3%
 
E11071.1%
 
x11071.1%
 
p11071.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
9426100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII107535100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
r1885217.5%
 
e1070910.0%
 
94268.8%
 
i94268.8%
 
u83197.7%
 
l83197.7%
 
A81437.6%
 
R70366.5%
 
g70366.5%
 
a70366.5%
 
s22142.1%
 
D12831.2%
 
v12831.2%
 
y12831.2%
 
T12831.2%
 
c12831.2%
 
k12831.2%
 
E11071.0%
 
x11071.0%
 
p11071.0%
 

Customer Segment
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Corporate
3375 
Home Office
2316 
Consumer
1894 
Small Business
1841 
ValueCountFrequency (%) 
Corporate337535.8%
 
Home Office231624.6%
 
Consumer189420.1%
 
Small Business184119.5%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length14
Median length9
Mean length10.26702737
Min length8

Overview of Unicode Properties

Unique unicode characters20
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e1174212.1%
 
o1096011.3%
 
r86448.9%
 
s74177.7%
 
m60516.3%
 
C52695.4%
 
a52165.4%
 
f46324.8%
 
41574.3%
 
i41574.3%
 
u37353.9%
 
n37353.9%
 
l36823.8%
 
p33753.5%
 
t33753.5%
 
H23162.4%
 
O23162.4%
 
c23162.4%
 
S18411.9%
 
B18411.9%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter7903781.7%
 
Uppercase Letter1358314.0%
 
Space Separator41574.3%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C526938.8%
 
H231617.1%
 
O231617.1%
 
S184113.6%
 
B184113.6%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e1174214.9%
 
o1096013.9%
 
r864410.9%
 
s74179.4%
 
m60517.7%
 
a52166.6%
 
f46325.9%
 
i41575.3%
 
u37354.7%
 
n37354.7%
 
l36824.7%
 
p33754.3%
 
t33754.3%
 
c23162.9%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
4157100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin9262095.7%
 
Common41574.3%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e1174212.7%
 
o1096011.8%
 
r86449.3%
 
s74178.0%
 
m60516.5%
 
C52695.7%
 
a52165.6%
 
f46325.0%
 
i41574.5%
 
u37354.0%
 
n37354.0%
 
l36824.0%
 
p33753.6%
 
t33753.6%
 
H23162.5%
 
O23162.5%
 
c23162.5%
 
S18412.0%
 
B18412.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
4157100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII96777100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e1174212.1%
 
o1096011.3%
 
r86448.9%
 
s74177.7%
 
m60516.3%
 
C52695.4%
 
a52165.4%
 
f46324.8%
 
41574.3%
 
i41574.3%
 
u37353.9%
 
n37353.9%
 
l36823.8%
 
p33753.5%
 
t33753.5%
 
H23162.4%
 
O23162.4%
 
c23162.4%
 
S18411.9%
 
B18411.9%
 

Product Category
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Office Supplies
5181 
Technology
2312 
Furniture
1933 
ValueCountFrequency (%) 
Office Supplies518155.0%
 
Technology231224.5%
 
Furniture193320.5%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length15
Median length15
Mean length12.54317844
Min length9

Overview of Unicode Properties

Unique unicode characters20
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e1460712.4%
 
i1229510.4%
 
f103628.8%
 
p103628.8%
 
u90477.7%
 
c74936.3%
 
l74936.3%
 
O51814.4%
 
51814.4%
 
S51814.4%
 
s51814.4%
 
o46243.9%
 
n42453.6%
 
r38663.3%
 
T23122.0%
 
h23122.0%
 
g23122.0%
 
y23122.0%
 
F19331.6%
 
t19331.6%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter9844483.3%
 
Uppercase Letter1460712.4%
 
Space Separator51814.4%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
O518135.5%
 
S518135.5%
 
T231215.8%
 
F193313.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e1460714.8%
 
i1229512.5%
 
f1036210.5%
 
p1036210.5%
 
u90479.2%
 
c74937.6%
 
l74937.6%
 
s51815.3%
 
o46244.7%
 
n42454.3%
 
r38663.9%
 
h23122.3%
 
g23122.3%
 
y23122.3%
 
t19332.0%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
5181100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin11305195.6%
 
Common51814.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e1460712.9%
 
i1229510.9%
 
f103629.2%
 
p103629.2%
 
u90478.0%
 
c74936.6%
 
l74936.6%
 
O51814.6%
 
S51814.6%
 
s51814.6%
 
o46244.1%
 
n42453.8%
 
r38663.4%
 
T23122.0%
 
h23122.0%
 
g23122.0%
 
y23122.0%
 
F19331.7%
 
t19331.7%
 

Most frequent Common characters

ValueCountFrequency (%) 
5181100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII118232100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e1460712.4%
 
i1229510.4%
 
f103628.8%
 
p103628.8%
 
u90477.7%
 
c74936.3%
 
l74936.3%
 
O51814.4%
 
51814.4%
 
S51814.4%
 
s51814.4%
 
o46243.9%
 
n42453.6%
 
r38663.3%
 
T23122.0%
 
h23122.0%
 
g23122.0%
 
y23122.0%
 
F19331.6%
 
t19331.6%
 

Product Sub-Category
Categorical

HIGH CORRELATION

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Paper
1379 
Binders and Binder Accessories
1028 
Telephones and Communication
992 
Office Furnishings
883 
Computer Peripherals
846 
Other values (12)
4298 
ValueCountFrequency (%) 
Paper137914.6%
 
Binders and Binder Accessories102810.9%
 
Telephones and Communication99210.5%
 
Office Furnishings8839.4%
 
Computer Peripherals8469.0%
 
Pens & Art Supplies7217.6%
 
Storage & Organization6106.5%
 
Appliances4925.2%
 
Chairs & Chairmats4404.7%
 
Tables4044.3%
 
Office Machines3764.0%
 
Labels3293.5%
 
Envelopes2722.9%
 
Bookcases2062.2%
 
Rubber Bands1952.1%
 
Scissors, Rulers and Trimmers1551.6%
 
Copiers and Fax981.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length30
Median length18
Mean length17.07288351
Min length5

Overview of Unicode Properties

Unique unicode characters37
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e1727010.7%
 
s133918.3%
 
i130368.1%
 
n123477.7%
 
122927.6%
 
r116187.2%
 
a107406.7%
 
o70074.4%
 
p68594.3%
 
c55363.4%
 
d45242.8%
 
t42192.6%
 
l42112.6%
 
h39772.5%
 
u37922.4%
 
m35802.2%
 
P29461.8%
 
C28161.7%
 
f25181.6%
 
B24571.5%
 
A22411.4%
 
g21031.3%
 
O18691.2%
 
&17711.1%
 
T15511.0%
 
Other values (12)62583.9%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter12903780.2%
 
Uppercase Letter1767411.0%
 
Space Separator122927.6%
 
Other Punctuation19261.2%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
P294616.7%
 
C281615.9%
 
B245713.9%
 
A224112.7%
 
O186910.6%
 
T15518.8%
 
S14868.4%
 
F9815.6%
 
M3762.1%
 
R3502.0%
 
L3291.9%
 
E2721.5%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e1727013.4%
 
s1339110.4%
 
i1303610.1%
 
n123479.6%
 
r116189.0%
 
a107408.3%
 
o70075.4%
 
p68595.3%
 
c55364.3%
 
d45243.5%
 
t42193.3%
 
l42113.3%
 
h39773.1%
 
u37922.9%
 
m35802.8%
 
f25182.0%
 
g21031.6%
 
b11230.9%
 
z6100.5%
 
v2720.2%
 
k2060.2%
 
x980.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
12292100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
&177192.0%
 
,1558.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin14671191.2%
 
Common142188.8%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e1727011.8%
 
s133919.1%
 
i130368.9%
 
n123478.4%
 
r116187.9%
 
a107407.3%
 
o70074.8%
 
p68594.7%
 
c55363.8%
 
d45243.1%
 
t42192.9%
 
l42112.9%
 
h39772.7%
 
u37922.6%
 
m35802.4%
 
P29462.0%
 
C28161.9%
 
f25181.7%
 
B24571.7%
 
A22411.5%
 
g21031.4%
 
O18691.3%
 
T15511.1%
 
S14861.0%
 
b11230.8%
 
Other values (9)34942.4%
 

Most frequent Common characters

ValueCountFrequency (%) 
1229286.5%
 
&177112.5%
 
,1551.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII160929100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e1727010.7%
 
s133918.3%
 
i130368.1%
 
n123477.7%
 
122927.6%
 
r116187.2%
 
a107406.7%
 
o70074.4%
 
p68594.3%
 
c55363.4%
 
d45242.8%
 
t42192.6%
 
l42112.6%
 
h39772.5%
 
u37922.4%
 
m35802.2%
 
P29461.8%
 
C28161.7%
 
f25181.6%
 
B24571.5%
 
A22411.4%
 
g21031.3%
 
O18691.2%
 
&17711.1%
 
T15511.0%
 
Other values (12)62583.9%
 
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Small Box
4887 
Wrap Bag
1312 
Small Pack
1067 
Jumbo Drum
703 
Jumbo Box
590 
Other values (2)
867 
ValueCountFrequency (%) 
Small Box488751.8%
 
Wrap Bag131213.9%
 
Small Pack106711.3%
 
Jumbo Drum7037.5%
 
Jumbo Box5906.3%
 
Large Box4574.8%
 
Medium Box4104.3%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length10
Median length9
Mean length9.09208572
Min length8

Overview of Unicode Properties

Unique unicode characters24
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
l1190813.9%
 
a1010211.8%
 
942611.0%
 
m83609.8%
 
B76568.9%
 
o76378.9%
 
x63447.4%
 
S59546.9%
 
r24722.9%
 
u24062.8%
 
g17692.1%
 
W13121.5%
 
p13121.5%
 
J12931.5%
 
b12931.5%
 
P10671.2%
 
c10671.2%
 
k10671.2%
 
e8671.0%
 
D7030.8%
 
L4570.5%
 
M4100.5%
 
d4100.5%
 
i4100.5%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter5742467.0%
 
Uppercase Letter1885222.0%
 
Space Separator942611.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
B765640.6%
 
S595431.6%
 
W13127.0%
 
J12936.9%
 
P10675.7%
 
D7033.7%
 
L4572.4%
 
M4102.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
l1190820.7%
 
a1010217.6%
 
m836014.6%
 
o763713.3%
 
x634411.0%
 
r24724.3%
 
u24064.2%
 
g17693.1%
 
p13122.3%
 
b12932.3%
 
c10671.9%
 
k10671.9%
 
e8671.5%
 
d4100.7%
 
i4100.7%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
9426100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin7627689.0%
 
Common942611.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
l1190815.6%
 
a1010213.2%
 
m836011.0%
 
B765610.0%
 
o763710.0%
 
x63448.3%
 
S59547.8%
 
r24723.2%
 
u24063.2%
 
g17692.3%
 
W13121.7%
 
p13121.7%
 
J12931.7%
 
b12931.7%
 
P10671.4%
 
c10671.4%
 
k10671.4%
 
e8671.1%
 
D7030.9%
 
L4570.6%
 
M4100.5%
 
d4100.5%
 
i4100.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
9426100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII85702100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
l1190813.9%
 
a1010211.8%
 
942611.0%
 
m83609.8%
 
B76568.9%
 
o76378.9%
 
x63447.4%
 
S59546.9%
 
r24722.9%
 
u24062.8%
 
g17692.1%
 
W13121.5%
 
p13121.5%
 
J12931.5%
 
b12931.5%
 
P10671.2%
 
c10671.2%
 
k10671.2%
 
e8671.0%
 
D7030.8%
 
L4570.5%
 
M4100.5%
 
d4100.5%
 
i4100.5%
 

Product Name
Categorical

HIGH CARDINALITY

Distinct1263
Distinct (%)13.4%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Global High-Back Leather Tilter, Burgundy
 
27
Bevis 36 x 72 Conference Tables
 
26
Master Giant Foot® Doorstop, Safety Yellow
 
24
BoxOffice By Design Rectangular and Half-Moon Meeting Room Tables
 
24
Wilson Jones Hanging View Binder, White, 1"
 
23
Other values (1258)
9302 
ValueCountFrequency (%) 
Global High-Back Leather Tilter, Burgundy270.3%
 
Bevis 36 x 72 Conference Tables260.3%
 
Master Giant Foot® Doorstop, Safety Yellow240.3%
 
BoxOffice By Design Rectangular and Half-Moon Meeting Room Tables240.3%
 
Wilson Jones Hanging View Binder, White, 1"230.2%
 
80 Minute CD-R Spindle, 100/Pack - Staples220.2%
 
Office Star - Mid Back Dual function Ergonomic High Back Chair with 2-Way Adjustable Arms220.2%
 
Fiskars® Softgrip Scissors220.2%
 
Peel & Seel® Recycled Catalog Envelopes, Brown220.2%
 
StarTAC 7760220.2%
 
Xerox 210210.2%
 
Bell Sonecor JB700 Caller ID210.2%
 
Global Troy™ Executive Leather Low-Back Tilter210.2%
 
Computer Printout Paper with Letter-Trim Perforations200.2%
 
Bush Westfield Collection Bookcases, Fully Assembled200.2%
 
Avery Flip-Chart Easel Binder, Black200.2%
 
80 Minute Slim Jewel Case CD-R , 10/Pack - Staples200.2%
 
Adesso Programmable 142-Key Keyboard200.2%
 
Staples 6 Outlet Surge200.2%
 
Hoover Portapower™ Portable Vacuum200.2%
 
Panasonic KX-P1150 Dot Matrix Printer200.2%
 
Staples® General Use 3-Ring Binders200.2%
 
US Robotics 56K V.92 External Faxmodem200.2%
 
Boston 1730 StandUp Electric Pencil Sharpener190.2%
 
Storex DuraTech Recycled Plastic Frosted Binders190.2%
 
Other values (1238)889194.3%
 
Frequencies of value counts

Unique

Unique48 ?
Unique (%)0.5%
Histogram of lengths of the category

Length

Max length98
Median length34
Mean length34.33131763
Min length3

Overview of Unicode Properties

Unique unicode characters84
Unique unicode categories12 ?
Unique unicode scripts2 ?
Unique unicode blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
3859311.9%
 
e290279.0%
 
r178105.5%
 
o173765.4%
 
a163845.1%
 
i157304.9%
 
l140494.3%
 
t139374.3%
 
n132394.1%
 
s128194.0%
 
c83562.6%
 
d64602.0%
 
u52401.6%
 
S51001.6%
 
C49841.5%
 
P44891.4%
 
B43151.3%
 
m41531.3%
 
g41281.3%
 
p40761.3%
 
h40251.2%
 
k37561.2%
 
135921.1%
 
y34811.1%
 
,31771.0%
 
Other values (59)6531120.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter20679663.9%
 
Uppercase Letter4804914.8%
 
Space Separator3859311.9%
 
Decimal Number186045.7%
 
Other Punctuation70912.2%
 
Dash Punctuation26080.8%
 
Other Symbol15520.5%
 
Final Punctuation81< 0.1%
 
Open Punctuation78< 0.1%
 
Close Punctuation78< 0.1%
 
Math Symbol42< 0.1%
 
Initial Punctuation35< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S510010.6%
 
C498410.4%
 
P44899.3%
 
B43159.0%
 
D29246.1%
 
A27845.8%
 
M26935.6%
 
F22544.7%
 
T22404.7%
 
R18563.9%
 
H18453.8%
 
E17933.7%
 
L17533.6%
 
I13042.7%
 
W12842.7%
 
O11562.4%
 
X11402.4%
 
G11022.3%
 
K7601.6%
 
V7021.5%
 
N6481.3%
 
U4380.9%
 
J2430.5%
 
Z870.2%
 
Y790.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e2902714.0%
 
r178108.6%
 
o173768.4%
 
a163847.9%
 
i157307.6%
 
l140496.8%
 
t139376.7%
 
n132396.4%
 
s128196.2%
 
c83564.0%
 
d64603.1%
 
u52402.5%
 
m41532.0%
 
g41282.0%
 
p40762.0%
 
h40251.9%
 
k37561.8%
 
y34811.7%
 
b29531.4%
 
x27051.3%
 
v22521.1%
 
f22101.1%
 
w20171.0%
 
j3300.2%
 
z2210.1%
 
Other values (2)62< 0.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
38593100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
1359219.3%
 
0309716.6%
 
2223912.0%
 
317139.2%
 
815938.6%
 
415718.4%
 
914157.6%
 
512746.8%
 
610655.7%
 
710455.6%
 

Most frequent Other Symbol characters

ValueCountFrequency (%) 
®101165.1%
 
54134.9%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-2608100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
,317744.8%
 
/151521.4%
 
"118616.7%
 
.5698.0%
 
&2403.4%
 
'1712.4%
 
#1121.6%
 
*681.0%
 
%480.7%
 
;50.1%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(6076.9%
 
[1823.1%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)6076.9%
 
]1823.1%
 

Most frequent Final Punctuation characters

ValueCountFrequency (%) 
81100.0%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
+42100.0%
 

Most frequent Initial Punctuation characters

ValueCountFrequency (%) 
35100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin25484578.8%
 
Common6876221.2%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e2902711.4%
 
r178107.0%
 
o173766.8%
 
a163846.4%
 
i157306.2%
 
l140495.5%
 
t139375.5%
 
n132395.2%
 
s128195.0%
 
c83563.3%
 
d64602.5%
 
u52402.1%
 
S51002.0%
 
C49842.0%
 
P44891.8%
 
B43151.7%
 
m41531.6%
 
g41281.6%
 
p40761.6%
 
h40251.6%
 
k37561.5%
 
y34811.4%
 
b29531.2%
 
D29241.1%
 
A27841.1%
 
Other values (28)3325013.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
3859356.1%
 
135925.2%
 
,31774.6%
 
030974.5%
 
-26083.8%
 
222393.3%
 
317132.5%
 
815932.3%
 
415712.3%
 
/15152.2%
 
914152.1%
 
512741.9%
 
"11861.7%
 
610651.5%
 
710451.5%
 
®10111.5%
 
.5690.8%
 
5410.8%
 
&2400.3%
 
'1710.2%
 
#1120.2%
 
810.1%
 
*680.1%
 
(600.1%
 
)600.1%
 
Other values (6)1660.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII32193799.5%
 
None10130.3%
 
Letterlike Symbols5410.2%
 
Punctuation116< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
3859312.0%
 
e290279.0%
 
r178105.5%
 
o173765.4%
 
a163845.1%
 
i157304.9%
 
l140494.4%
 
t139374.3%
 
n132394.1%
 
s128194.0%
 
c83562.6%
 
d64602.0%
 
u52401.6%
 
S51001.6%
 
C49841.5%
 
P44891.4%
 
B43151.3%
 
m41531.3%
 
g41281.3%
 
p40761.3%
 
h40251.3%
 
k37561.2%
 
135921.1%
 
y34811.1%
 
,31771.0%
 
Other values (54)6364119.8%
 

Most frequent Letterlike Symbols characters

ValueCountFrequency (%) 
541100.0%
 

Most frequent None characters

ValueCountFrequency (%) 
®101199.8%
 
à20.2%
 

Most frequent Punctuation characters

ValueCountFrequency (%) 
8169.8%
 
3530.2%
 

Product Base Margin
Real number (ℝ≥0)

Distinct51
Distinct (%)0.5%
Missing72
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean0.5121894377
Minimum0.35
Maximum0.85
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum0.35
5-th percentile0.36
Q10.38
median0.52
Q30.59
95-th percentile0.78
Maximum0.85
Range0.5
Interquartile range (IQR)0.21

Descriptive statistics

Standard deviation0.1352287426
Coefficient of variation (CV)0.2640209513
Kurtosis-0.6535081841
Mean0.5121894377
Median Absolute Deviation (MAD)0.12
Skewness0.5612476362
Sum4791.02
Variance0.01828681282
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.378559.1%
 
0.387548.0%
 
0.366957.4%
 
0.595636.0%
 
0.395415.7%
 
0.575235.5%
 
0.565105.4%
 
0.44695.0%
 
0.584304.6%
 
0.553553.8%
 
0.63293.5%
 
0.352963.1%
 
0.521721.8%
 
0.741191.3%
 
0.431171.2%
 
0.411151.2%
 
0.651111.2%
 
0.641101.2%
 
0.441071.1%
 
0.781001.1%
 
0.491001.1%
 
0.75931.0%
 
0.83901.0%
 
0.48890.9%
 
0.66890.9%
 
Other values (26)162217.2%
 
ValueCountFrequency (%) 
0.352963.1%
 
0.366957.4%
 
0.378559.1%
 
0.387548.0%
 
0.395415.7%
 
0.44695.0%
 
0.411151.2%
 
0.42860.9%
 
0.431171.2%
 
0.441071.1%
 
ValueCountFrequency (%) 
0.85400.4%
 
0.84290.3%
 
0.83901.0%
 
0.82340.4%
 
0.81820.9%
 
0.8550.6%
 
0.79740.8%
 
0.781001.1%
 
0.77770.8%
 
0.76600.6%
 

Region
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Central
2899 
East
2289 
West
2284 
South
1954 
ValueCountFrequency (%) 
Central289930.8%
 
East228924.3%
 
West228424.2%
 
South195420.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length7
Median length5
Mean length5.129959686
Min length4

Overview of Unicode Properties

Unique unicode characters14
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
t942619.5%
 
a518810.7%
 
e518310.7%
 
s45739.5%
 
C28996.0%
 
n28996.0%
 
r28996.0%
 
l28996.0%
 
E22894.7%
 
W22844.7%
 
S19544.0%
 
o19544.0%
 
u19544.0%
 
h19544.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter3892980.5%
 
Uppercase Letter942619.5%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C289930.8%
 
E228924.3%
 
W228424.2%
 
S195420.7%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
t942624.2%
 
a518813.3%
 
e518313.3%
 
s457311.7%
 
n28997.4%
 
r28997.4%
 
l28997.4%
 
o19545.0%
 
u19545.0%
 
h19545.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin48355100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
t942619.5%
 
a518810.7%
 
e518310.7%
 
s45739.5%
 
C28996.0%
 
n28996.0%
 
r28996.0%
 
l28996.0%
 
E22894.7%
 
W22844.7%
 
S19544.0%
 
o19544.0%
 
u19544.0%
 
h19544.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII48355100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
t942619.5%
 
a518810.7%
 
e518310.7%
 
s45739.5%
 
C28996.0%
 
n28996.0%
 
r28996.0%
 
l28996.0%
 
E22894.7%
 
W22844.7%
 
S19544.0%
 
o19544.0%
 
u19544.0%
 
h19544.0%
 

State or Province
Categorical

HIGH CORRELATION

Distinct49
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
California
1021 
Texas
646 
Illinois
 
584
New York
 
574
Florida
 
522
Other values (44)
6079 
ValueCountFrequency (%) 
California102110.8%
 
Texas6466.9%
 
Illinois5846.2%
 
New York5746.1%
 
Florida5225.5%
 
Ohio3964.2%
 
Washington3273.5%
 
Michigan3273.5%
 
Pennsylvania2712.9%
 
North Carolina2512.7%
 
Indiana2412.6%
 
Minnesota2392.5%
 
Massachusetts2222.4%
 
Georgia2142.3%
 
Virginia1982.1%
 
Maryland1781.9%
 
Colorado1771.9%
 
New Jersey1771.9%
 
Wisconsin1691.8%
 
Oregon1681.8%
 
Tennessee1661.8%
 
Missouri1611.7%
 
Iowa1561.7%
 
Utah1461.5%
 
Arizona1341.4%
 
Other values (24)176118.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length20
Median length8
Mean length8.298005517
Min length4

Overview of Unicode Properties

Unique unicode characters46
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a985612.6%
 
i901011.5%
 
n70269.0%
 
o67928.7%
 
s52996.8%
 
r46065.9%
 
e43725.6%
 
l40255.1%
 
t21722.8%
 
h21282.7%
 
C17042.2%
 
15061.9%
 
M14661.9%
 
d13151.7%
 
g12981.7%
 
N12941.7%
 
c11171.4%
 
I11151.4%
 
f10891.4%
 
w10601.4%
 
k10231.3%
 
u8381.1%
 
T8121.0%
 
y7300.9%
 
x7300.9%
 
Other values (21)58347.5%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter6584784.2%
 
Uppercase Letter1086413.9%
 
Space Separator15061.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C170415.7%
 
M146613.5%
 
N129411.9%
 
I111510.3%
 
T8127.5%
 
O6686.1%
 
Y5745.3%
 
W5605.2%
 
F5224.8%
 
A3823.5%
 
V3022.8%
 
P2712.5%
 
K2162.0%
 
G2142.0%
 
J1771.6%
 
U1461.3%
 
D1451.3%
 
S1331.2%
 
L890.8%
 
H540.5%
 
R200.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a985615.0%
 
i901013.7%
 
n702610.7%
 
o679210.3%
 
s52998.0%
 
r46067.0%
 
e43726.6%
 
l40256.1%
 
t21723.3%
 
h21283.2%
 
d13152.0%
 
g12982.0%
 
c11171.7%
 
f10891.7%
 
w10601.6%
 
k10231.6%
 
u8381.3%
 
y7301.1%
 
x7301.1%
 
m4330.7%
 
v3140.5%
 
b2700.4%
 
p2100.3%
 
z1340.2%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
1506100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin7671198.1%
 
Common15061.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a985612.8%
 
i901011.7%
 
n70269.2%
 
o67928.9%
 
s52996.9%
 
r46066.0%
 
e43725.7%
 
l40255.2%
 
t21722.8%
 
h21282.8%
 
C17042.2%
 
M14661.9%
 
d13151.7%
 
g12981.7%
 
N12941.7%
 
c11171.5%
 
I11151.5%
 
f10891.4%
 
w10601.4%
 
k10231.3%
 
u8381.1%
 
T8121.1%
 
y7301.0%
 
x7301.0%
 
O6680.9%
 
Other values (20)51666.7%
 

Most frequent Common characters

ValueCountFrequency (%) 
1506100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII78217100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a985612.6%
 
i901011.5%
 
n70269.0%
 
o67928.7%
 
s52996.8%
 
r46065.9%
 
e43725.6%
 
l40255.1%
 
t21722.8%
 
h21282.7%
 
C17042.2%
 
15061.9%
 
M14661.9%
 
d13151.7%
 
g12981.7%
 
N12941.7%
 
c11171.4%
 
I11151.4%
 
f10891.4%
 
w10601.4%
 
k10231.3%
 
u8381.1%
 
T8121.0%
 
y7300.9%
 
x7300.9%
 
Other values (21)58347.5%
 

City
Categorical

HIGH CARDINALITY

Distinct1424
Distinct (%)15.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
New York City
 
202
Los Angeles
 
196
Seattle
 
93
Chicago
 
90
Boston
 
80
Other values (1419)
8765 
ValueCountFrequency (%) 
New York City2022.1%
 
Los Angeles1962.1%
 
Seattle931.0%
 
Chicago901.0%
 
Boston800.8%
 
Washington680.7%
 
Philadelphia580.6%
 
Miami500.5%
 
Charlotte470.5%
 
Houston460.5%
 
Detroit420.4%
 
Atlanta400.4%
 
Dallas380.4%
 
San Francisco370.4%
 
San Diego360.4%
 
Springfield310.3%
 
Columbus310.3%
 
Auburn260.3%
 
Roswell250.3%
 
Marion240.3%
 
Sanford230.2%
 
Clinton220.2%
 
Boise220.2%
 
Twentynine Palms220.2%
 
Burlington220.2%
 
Other values (1399)805585.5%
 
Frequencies of value counts

Unique

Unique151 ?
Unique (%)1.6%
Histogram of lengths of the category

Length

Max length19
Median length9
Mean length9.170698069
Min length3

Overview of Unicode Properties

Unique unicode characters52
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e81359.4%
 
a74138.6%
 
o65177.5%
 
n62137.2%
 
l55976.5%
 
i52426.1%
 
r52326.1%
 
t47335.5%
 
s39994.6%
 
34364.0%
 
d20492.4%
 
u18542.1%
 
g18182.1%
 
h17912.1%
 
C15401.8%
 
c14001.6%
 
k13651.6%
 
y12891.5%
 
m11471.3%
 
S11331.3%
 
w11021.3%
 
P10041.2%
 
v10031.2%
 
B9231.1%
 
M8941.0%
 
Other values (27)961411.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter7014581.1%
 
Uppercase Letter1286214.9%
 
Space Separator34364.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C154012.0%
 
S11338.8%
 
P10047.8%
 
B9237.2%
 
M8947.0%
 
L8846.9%
 
H7445.8%
 
A7015.5%
 
W6234.8%
 
R5484.3%
 
N5324.1%
 
D4513.5%
 
F4473.5%
 
G4423.4%
 
O3472.7%
 
E3252.5%
 
T3212.5%
 
Y2351.8%
 
V2231.7%
 
K2101.6%
 
I1411.1%
 
J1251.0%
 
U470.4%
 
Q160.1%
 
X6< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e813511.6%
 
a741310.6%
 
o65179.3%
 
n62138.9%
 
l55978.0%
 
i52427.5%
 
r52327.5%
 
t47336.7%
 
s39995.7%
 
d20492.9%
 
u18542.6%
 
g18182.6%
 
h17912.6%
 
c14002.0%
 
k13651.9%
 
y12891.8%
 
m11471.6%
 
w11021.6%
 
v10031.4%
 
p7591.1%
 
b7041.0%
 
f4760.7%
 
x1520.2%
 
z730.1%
 
q670.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
3436100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin8300796.0%
 
Common34364.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e81359.8%
 
a74138.9%
 
o65177.9%
 
n62137.5%
 
l55976.7%
 
i52426.3%
 
r52326.3%
 
t47335.7%
 
s39994.8%
 
d20492.5%
 
u18542.2%
 
g18182.2%
 
h17912.2%
 
C15401.9%
 
c14001.7%
 
k13651.6%
 
y12891.6%
 
m11471.4%
 
S11331.4%
 
w11021.3%
 
P10041.2%
 
v10031.2%
 
B9231.1%
 
M8941.1%
 
L8841.1%
 
Other values (26)873010.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
3436100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII86443100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e81359.4%
 
a74138.6%
 
o65177.5%
 
n62137.2%
 
l55976.5%
 
i52426.1%
 
r52326.1%
 
t47335.5%
 
s39994.6%
 
34364.0%
 
d20492.4%
 
u18542.1%
 
g18182.1%
 
h17912.1%
 
C15401.8%
 
c14001.6%
 
k13651.6%
 
y12891.5%
 
m11471.3%
 
S11331.3%
 
w11021.3%
 
P10041.2%
 
v10031.2%
 
B9231.1%
 
M8941.0%
 
Other values (27)961411.1%
 

Postal Code
Real number (ℝ≥0)

Distinct1697
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52446.32729
Minimum1001
Maximum99362
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum1001
5-th percentile5451
Q129406
median52302
Q378516
95-th percentile97035
Maximum99362
Range98361
Interquartile range (IQR)49110

Descriptive statistics

Standard deviation29374.5978
Coefficient of variation (CV)0.5600887483
Kurtosis-1.208888975
Mean52446.32729
Median Absolute Deviation (MAD)24714
Skewness-0.04255795839
Sum494359081
Variance862866996
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
10177540.6%
 
90049470.5%
 
20016370.4%
 
60601360.4%
 
90045330.4%
 
2113290.3%
 
98115270.3%
 
90041260.3%
 
30318240.3%
 
98119230.2%
 
77070230.2%
 
94110220.2%
 
92277220.2%
 
28206210.2%
 
88201210.2%
 
19112200.2%
 
98103200.2%
 
81301200.2%
 
59715190.2%
 
28204190.2%
 
55372190.2%
 
92037180.2%
 
2118180.2%
 
87105180.2%
 
75220170.2%
 
Other values (1672)879393.3%
 
ValueCountFrequency (%) 
10011< 0.1%
 
10071< 0.1%
 
10131< 0.1%
 
10271< 0.1%
 
10281< 0.1%
 
10401< 0.1%
 
10561< 0.1%
 
10601< 0.1%
 
10691< 0.1%
 
10752< 0.1%
 
ValueCountFrequency (%) 
9936280.1%
 
9935250.1%
 
9933660.1%
 
9930160.1%
 
9920770.1%
 
9916370.1%
 
989023< 0.1%
 
9880170.1%
 
9866180.1%
 
9863260.1%
 
Distinct1419
Distinct (%)15.1%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Minimum2010-01-01 00:00:00
Maximum2013-12-31 00:00:00
Histogram with fixed size bins (bins=50)
Distinct1450
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size73.8 KiB
Minimum2010-01-02 00:00:00
Maximum2014-01-17 00:00:00
Histogram with fixed size bins (bins=50)

Profit
Real number (ℝ)

Distinct8984
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean139.2364099
Minimum-16476.838
Maximum16332.414
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum-16476.838
5-th percentile-545.82
Q1-74.017375
median2.5676
Q3140.24385
95-th percentile1322.366025
Maximum16332.414
Range32809.252
Interquartile range (IQR)214.261225

Descriptive statistics

Standard deviation998.4864829
Coefficient of variation (CV)7.171159353
Kurtosis56.53434226
Mean139.2364099
Median Absolute Deviation (MAD)96.34976
Skewness0.8414114744
Sum1312442.399
Variance996975.2566
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
-969.0483664< 0.1%
 
11.650954< 0.1%
 
33.988954< 0.1%
 
-1356.6677123< 0.1%
 
-56.353< 0.1%
 
5.713< 0.1%
 
18.273< 0.1%
 
-44.863< 0.1%
 
6.113< 0.1%
 
6.793< 0.1%
 
-17.493< 0.1%
 
-106.4213< 0.1%
 
-1331.5533663< 0.1%
 
-433.2901433< 0.1%
 
-33.113< 0.1%
 
-2.233< 0.1%
 
-715.7782063< 0.1%
 
-513.790423< 0.1%
 
-5.413< 0.1%
 
-39.923< 0.1%
 
-48.973< 0.1%
 
-60.5642< 0.1%
 
772.042< 0.1%
 
19.042< 0.1%
 
221.632< 0.1%
 
Other values (8959)935299.2%
 
ValueCountFrequency (%) 
-16476.8381< 0.1%
 
-14369.123581< 0.1%
 
-14140.70161< 0.1%
 
-13706.4641< 0.1%
 
-13562.637411< 0.1%
 
-10402.943921< 0.1%
 
-10263.65971< 0.1%
 
-9078.941< 0.1%
 
-8570.44831< 0.1%
 
-7961.43091< 0.1%
 
ValueCountFrequency (%) 
16332.4141< 0.1%
 
12504.90451< 0.1%
 
11429.47741< 0.1%
 
9228.22561< 0.1%
 
9195.9751< 0.1%
 
8917.71871< 0.1%
 
8798.18311< 0.1%
 
8752.04281< 0.1%
 
8202.56821< 0.1%
 
8118.89191< 0.1%
 

Quantity ordered new
Real number (ℝ≥0)

Distinct112
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.79842987
Minimum1
Maximum170
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q15
median10
Q317
95-th percentile42
Maximum170
Range169
Interquartile range (IQR)12

Descriptive statistics

Standard deviation15.10768777
Coefficient of variation (CV)1.094884557
Kurtosis19.0047279
Mean13.79842987
Median Absolute Deviation (MAD)5
Skewness3.526649962
Sum130064
Variance228.2422299
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
16286.7%
 
54995.3%
 
94995.3%
 
114985.3%
 
124985.3%
 
34905.2%
 
84875.2%
 
24825.1%
 
44484.8%
 
74464.7%
 
104444.7%
 
64384.6%
 
133623.8%
 
143213.4%
 
162692.9%
 
152572.7%
 
182292.4%
 
172142.3%
 
201781.9%
 
191741.8%
 
211571.7%
 
221431.5%
 
231361.4%
 
24840.9%
 
25690.7%
 
Other values (87)97610.4%
 
ValueCountFrequency (%) 
16286.7%
 
24825.1%
 
34905.2%
 
44484.8%
 
54995.3%
 
64384.6%
 
74464.7%
 
84875.2%
 
94995.3%
 
104444.7%
 
ValueCountFrequency (%) 
1701< 0.1%
 
1672< 0.1%
 
1621< 0.1%
 
1601< 0.1%
 
1551< 0.1%
 
1511< 0.1%
 
1481< 0.1%
 
1463< 0.1%
 
1393< 0.1%
 
1372< 0.1%
 

Sales
Real number (ℝ≥0)

Distinct8674
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean949.706272
Minimum1.32
Maximum100119.16
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum1.32
5-th percentile14.52
Q161.2825
median203.455
Q3776.4025
95-th percentile4209.38
Maximum100119.16
Range100117.84
Interquartile range (IQR)715.12

Descriptive statistics

Standard deviation2598.019818
Coefficient of variation (CV)2.735603517
Kurtosis300.1297924
Mean949.706272
Median Absolute Deviation (MAD)174.405
Skewness12.21253075
Sum8951931.32
Variance6749706.976
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
80.584< 0.1%
 
46.854< 0.1%
 
10.484< 0.1%
 
14.534< 0.1%
 
119.864< 0.1%
 
33.534< 0.1%
 
9.643< 0.1%
 
52.933< 0.1%
 
8.493< 0.1%
 
21.463< 0.1%
 
8.743< 0.1%
 
10.233< 0.1%
 
20.833< 0.1%
 
72.773< 0.1%
 
10.143< 0.1%
 
36.163< 0.1%
 
106.653< 0.1%
 
42.023< 0.1%
 
28.463< 0.1%
 
23.563< 0.1%
 
67.53< 0.1%
 
20.513< 0.1%
 
200.643< 0.1%
 
39.153< 0.1%
 
16.63< 0.1%
 
Other values (8649)934599.1%
 
ValueCountFrequency (%) 
1.321< 0.1%
 
1.621< 0.1%
 
1.651< 0.1%
 
2.241< 0.1%
 
2.252< 0.1%
 
2.661< 0.1%
 
2.741< 0.1%
 
2.771< 0.1%
 
2.871< 0.1%
 
3.071< 0.1%
 
ValueCountFrequency (%) 
100119.161< 0.1%
 
50332.661< 0.1%
 
48418.581< 0.1%
 
45737.331< 0.1%
 
43046.21< 0.1%
 
40136.931< 0.1%
 
36532.461< 0.1%
 
35147.331< 0.1%
 
32589.591< 0.1%
 
32510.211< 0.1%
 

Order ID
Real number (ℝ≥0)

HIGH CORRELATION

Distinct6455
Distinct (%)68.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82318.48907
Minimum6
Maximum91591
Zeros0
Zeros (%)0.0%
Memory size73.8 KiB

Quantile statistics

Minimum6
5-th percentile28822.25
Q186737.25
median88344.5
Q389987.75
95-th percentile91256
Maximum91591
Range91585
Interquartile range (IQR)3250.5

Descriptive statistics

Standard deviation19149.44886
Coefficient of variation (CV)0.2326263404
Kurtosis6.9533914
Mean82318.48907
Median Absolute Deviation (MAD)1628
Skewness-2.860264943
Sum775934078
Variance366701391.5
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
4374560.1%
 
9057160.1%
 
8694350.1%
 
9083950.1%
 
8651050.1%
 
8698350.1%
 
9011350.1%
 
8703350.1%
 
9008250.1%
 
8696050.1%
 
8693950.1%
 
4845250.1%
 
8902750.1%
 
8986750.1%
 
8958050.1%
 
4252850.1%
 
8806050.1%
 
9148550.1%
 
22094< 0.1%
 
868384< 0.1%
 
867454< 0.1%
 
881784< 0.1%
 
910094< 0.1%
 
860814< 0.1%
 
897654< 0.1%
 
Other values (6430)930698.7%
 
ValueCountFrequency (%) 
61< 0.1%
 
1931< 0.1%
 
3222< 0.1%
 
3582< 0.1%
 
3591< 0.1%
 
3862< 0.1%
 
3881< 0.1%
 
4541< 0.1%
 
5483< 0.1%
 
6122< 0.1%
 
ValueCountFrequency (%) 
915913< 0.1%
 
915902< 0.1%
 
915892< 0.1%
 
915881< 0.1%
 
915871< 0.1%
 
915861< 0.1%
 
915852< 0.1%
 
915841< 0.1%
 
915831< 0.1%
 
915821< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

Row IDOrder PriorityDiscountUnit PriceShipping CostCustomer IDCustomer NameShip ModeCustomer SegmentProduct CategoryProduct Sub-CategoryProduct ContainerProduct NameProduct Base MarginRegionState or ProvinceCityPostal CodeOrder DateShip DateProfitQuantity ordered newSalesOrder ID
018606Not Specified0.012.880.502Janice FletcherRegular AirCorporateOffice SuppliesLabelsSmall BoxAvery 490.36CentralIllinoisAddison601012012-05-282012-05-301.320025.9088525
120847High0.012.840.933Bonnie PotterExpress AirCorporateOffice SuppliesPens & Art SuppliesWrap BagSANFORD Liquid Accent™ Tank-Style Highlighters0.54WestWashingtonAnacortes982212010-07-072010-07-084.5600413.0188522
223086Not Specified0.036.686.153Bonnie PotterExpress AirCorporateOffice SuppliesPaperSmall BoxXerox 19680.37WestWashingtonAnacortes982212011-07-272011-07-28-47.6400749.9288523
323087Not Specified0.015.683.603Bonnie PotterRegular AirCorporateOffice SuppliesScissors, Rulers and TrimmersSmall PackAcme® Preferred Stainless Steel Scissors0.56WestWashingtonAnacortes982212011-07-272011-07-28-30.5100741.6488523
423088Not Specified0.00205.992.503Bonnie PotterExpress AirCorporateTechnologyTelephones and CommunicationSmall BoxV700.59WestWashingtonAnacortes982212011-07-272011-07-27998.202381446.6788523
523597Medium0.0955.4814.303Bonnie PotterExpress AirCorporateOffice SuppliesPaperSmall BoxXerox 1940.37WestWashingtonAnacortes982212011-11-092011-11-111388.0523372011.6788524
625549Low0.08120.9726.303Bonnie PotterDelivery TruckCorporateTechnologyOffice MachinesJumbo DrumCanon S750 Color Inkjet Printer0.38WestWashingtonAnacortes982212013-07-012013-07-081001.4453121451.3788526
720228Not Specified0.02500.9826.005Ronnie ProctorDelivery TruckHome OfficeFurnitureChairs & ChairmatsJumbo DrumGlobal Troy™ Executive Leather Low-Back Tilter0.60WestCaliforniaSan Gabriel917762010-12-132010-12-154390.3665126362.8590193
819483Low0.086.486.815Ronnie ProctorRegular AirHome OfficeOffice SuppliesPaperSmall BoxXerox 19300.36WestCaliforniaSan Gabriel917762012-05-122012-05-21-141.260018113.2590197
924782High0.0190.240.996Dwight HwangRegular AirHome OfficeOffice SuppliesAppliancesSmall BoxKensington 6 Outlet MasterPiece® HOMEOFFICE Power Control Center0.56WestCaliforniaSan Jose951232011-05-262011-05-261045.4673161515.1790194

Last rows

Row IDOrder PriorityDiscountUnit PriceShipping CostCustomer IDCustomer NameShip ModeCustomer SegmentProduct CategoryProduct Sub-CategoryProduct ContainerProduct NameProduct Base MarginRegionState or ProvinceCityPostal CodeOrder DateShip DateProfitQuantity ordered newSalesOrder ID
941624933Medium0.017.645.833400Florence GoldRegular AirCorporateOffice SuppliesPaperWrap BagRediform Wirebound "Phone Memo" Message Book, 11 x 5-3/40.36EastWest VirginiaFairmont265542012-11-292012-11-293.312000867.5087545
941723413Critical0.07115.791.993400Florence GoldRegular AirCorporateTechnologyComputer PeripheralsSmall PackVerbatim DVD-R, 4.7GB, Spindle, WE, Blank, Ink Jet/Thermal, 20/Spindle0.49EastWest VirginiaFairmont265542013-04-272013-04-29606.3099008878.7187546
941823414Critical0.0237.444.273400Florence GoldRegular AirCorporateOffice SuppliesPens & Art SuppliesWrap BagSanford Prismacolor® Professional Thick Lead Art Pencils, 36-Color Set0.46EastWest VirginiaFairmont265542013-04-272013-04-30-15.030000143.9087546
941924912Medium0.0535.991.253400Florence GoldRegular AirSmall BusinessTechnologyTelephones and CommunicationSmall PackAccessory230.36EastWest VirginiaFairmont265542013-10-042013-10-05411.48150019596.3587549
942018329High0.08212.6052.203402Frederick ColeDelivery TruckConsumerFurnitureTablesJumbo BoxBush Advantage Collection® Round Conference Table0.64EastWest VirginiaCharleston253142011-04-122011-04-12-513.790420101969.3187531
942120275Critical0.0635.8914.723402Frederick ColeRegular AirConsumerOffice SuppliesEnvelopesSmall BoxJet-Pak Recycled Peel 'N' Seal Padded Mailers0.40EastWest VirginiaCharleston253142013-05-142013-05-15137.86000013447.8787532
942220276Critical0.003.347.493402Frederick ColeRegular AirConsumerOffice SuppliesPens & Art SuppliesWrap BagEldon Spacemaker® Box, Quick-Snap Lid, Clear0.54EastWest VirginiaCharleston253142013-05-142013-05-14-39.070000313.2387532
942324491Not Specified0.08550.9845.703402Frederick ColeDelivery TruckConsumerFurnitureTablesJumbo BoxChromcraft Bull-Nose Wood Oval Conference Tables & Bases0.71EastWest VirginiaCharleston253142013-09-122013-09-14-1225.02909742215.9387533
942425914High0.10105.9813.993403Tammy BuckleyExpress AirConsumerFurnitureOffice FurnishingsMedium BoxTenex 46" x 60" Computer Anti-Static Chairmat, Rectangular Shaped0.65WestWyomingCheyenne820012010-02-082010-02-11349.4850005506.5087530
942524492Not Specified0.097.782.503403Tammy BuckleyExpress AirConsumerOffice SuppliesEnvelopesSmall BoxStaples #10 Colored Envelopes0.38WestWyomingCheyenne820012013-09-122013-09-1478.06240023172.4887533